Random Sampling with Removal
نویسندگان
چکیده
Random sampling is a classical tool in constrained optimization. Under favorable conditions, the optimal solution subject to a small subset of randomly chosen constraints violates only a small subset of the remaining constraints. Here we study the following variant that we call random sampling with removal: suppose that after sampling the subset, we remove a fixed number of constraints from the sample, according to an arbitrary rule. Is it still true that the optimal solution of the reduced sample violates only a small subset of the constraints? The question naturally comes up in situations where the solution subject to the sampled constraints is used as an approximate solution to the original problem. We study random sampling with removal in a generalized, completely abstract setting where we assign to each subset R of the constraints an arbitrary set V (R) of constraints disjoint from R; in applications, V (R) corresponds to the constraints violated by the optimal solution subject to only the constraints in R. Furthermore, our results are parametrized by the dimension δ, i.e., we assume that every set R has a subset B of size at most δ with the same set of violated constraints. This is the first time this generalized setting is studied. In this setting, we prove matching upper and lower bounds for the expected number of constraints violated by a random sample, after the removal of k elements. For a large range of values of k, the new upper bounds improve the previously best bounds for LPtype problems, which moreover had only been known in special cases. We show that this bound on special LP-type problems can be derived in the much more general setting of violator spaces, and with very elementary proofs.
منابع مشابه
Simplifying Boolean Networks
This paper explores the compressibility of complex systems by considering the simplification of Boolean networks. A method, which is similar to that reported by Bastolla and Parisi,4,5 is considered that is based on the removal of frozen nodes, or stable variables, and network “leaves,” i.e. those nodes with outdegree = 0. The method uses a random sampling approach to identify the minimum set o...
متن کاملImprovement of Chemical Named Entity Recognition through Sentence-based Random Under-sampling and Classifier Combination
Chemical Named Entity Recognition (NER) is the basic step for consequent information extraction tasks such as named entity resolution, drug-drug interaction discovery, extraction of the names of the molecules and their properties. Improvement in the performance of such systems may affects the quality of the subsequent tasks. Chemical text from which data for named entity recognition is extracte...
متن کاملInference about density and temporary emigration in unmarked populations.
Few species are distributed uniformly in space, and populations of mobile organisms are rarely closed with respect to movement, yet many models of density rely upon these assumptions. We present a hierarchical model allowing inference about the density of unmarked populations subject to temporary emigration and imperfect detection. The model can be fit to data collected using a variety of stand...
متن کاملUK experience in the monitoring and control of lead in drinking water.
At the zonal scale (e.g. a city or town), random daytime (RDT) sampling succeeded in demonstrating both the need for corrective action and the benefits of optimised orthophosphate dosing for plumbosolvency control, despite initial concerns about sampling reproducibility. Stagnation sampling techniques were found to be less successful. Optimised treatment measures to minimise lead in drinking wa...
متن کاملGeneration and Analysis of Constrained Random Sampling Patterns
Random sampling is a technique for signal acquisition which is gaining popularity in practical signal processing systems. Nowadays, eventdriven analog-to-digital converters make random sampling feasible in practical applications. A process of random sampling is defined by a sampling pattern, which indicates signal sampling points in time. Practical random sampling patterns are constrained by AD...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016